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The present invention relates to planetary gear 
systems used in automatic transmissions. 

Japanese Patent Provisional (unexamined) 
Publication -No. 50-64660 -discloses— one conventional 
example. The planetary gear system of this 
conventional example has three single planet type 
gear sets, as shown in Fig. 47 of the drawings. The 
gear train shown in Fig. 47 has a first rotary member 
Ml composed of second and third sun gears , a second 
rotary member M2_f ormed_by~ a_-third_-carrier,-_.a third 
rotary member M3 composed of a first ring gear, a 
second carrier and a third ring gears, -a fourth 
rotary member M4 composed of a second ring gear and a 
first carrier, and a fifth rotary member M5 formed by 
a first sun gear. The gear system further includes a 
connecting means which connects each of input shaft, 
output shaft, and case (or housing) with one or more 
of the five rotary members directly or through a 
selective engaging element such as a clutch or a 
brake. The input shaft is connected to the fifth, 
third and first rotary members M5, M3 and Ml, 
respectively, through first, second and third 
clutches CI, C2 and C3. The fifth, fourth and third 
rotary members M5, M4 and M3 are connected with the 
case through first, second and third brakes Bl, B2 
and B3 . The second rotary member M2 is connected 
directly to the output shaft. This gear system 
provides six forward speeds including a direct drive 
speed, and two reverse speeds by engaging two of the 
selective engaging elements simultaneously, as shown 
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A Virtual Web Caching System 



Background of the Invention 

This invention relates to web cache servers. 

One of the major problems that needs to be tackled as the World Wide Web 
continues its exponential growth is the severe congestion that Web users are experiencing. 
Although increasing network bandwidth would help, as more information services are 
moved onto the Web the demand for bandwidth is likely to continue to outstrip supply for 
the foreseeable future. 

Caching has been widely recognized as one of the solutions to this congestion 
problem, because caching can substantially reduce latency and improve bandwidth 
efficiency. With extensive use of caching, along with pre-fetching, replication and 
mirroring, significant traffic reduction can be achieved on bottleneck links. 

Caching may take place at various points between a user and the data source. For 
example, most modern browsers have built-in caching which keeps recently accessed web 
pages in the PC itself Proxy caching was incorporated in the CERN httpd, which is one 
of the oldest Web servers. A CERN server can also chain to another caching server. Most 
second-generation Web servers, such as Apache, Jigsaw, and NetSite support similar 
caching functions. A comprehensive report on Web caching can be found in A. Cormack, 
Web Caching, Technical Report, University of Wales, UK, Sept 1996. 

Another important work on caching, known as the Harvest Cache, is described by 
A. Chankhunthod, P. Danzig, C. Neerdales, M. Schwartz, and K. Worrell, in 
A hierarchical Internet object cache, Usenix'96, Jan 1996. The Harvest cache possesses a 
hierarchical caching architecture in which cache servers that are positioned at selected 
points of a large network co-operate with each other using Internet Cache Protocol (ICP). 
When A Harvest cache server receives a request for an object that is not cached locally, 
the server queries its neighbors and hierarchical parents to find the sought object. If that 
object is not found, the cache server directs the request to the object's source. 

After the Harvest project ended August 1996, the development of the Harvest 
cache has been carried out by the Harvest Developers Group as a commercial product, and 
by National Laboratory for Advanced Networking Research (NLANR). A report on this 
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project can be found in http://www.nlanr.net/Cache/. A similar system was proposed by 
R. Malpani, J. Lorch, D. Berger, in Making World Wide Web Caching Servers 
Cooperate, 4th International World Wide Web Conference, Boston, USA, Dec 1995, 
where multicasting is used for querying neighbor servers. 

Although the Harvest cache represents an advance in the art it, nevertheless, has a 
number of problems. 

One problem is that for each local cache "MISS", the cache server has to send a 
query message to all its neighbors and parents, and receive a reply from each of them. 
This represents an overhead of about 200 bytes per neighbor or parent. In a large Harvest 
cache system with many cache servers or a deep hierarchy, this overhead is high, and 
represents a particularly inefficient use of resources when the requested object is small. 
Additionally, the request/reply interaction introduces some latency during the resolution. 

Another problem of the Harvest cache is that the search for a cached object is done 
in a distributed fashion. For each query, all neighbors and parents have to search their 
caches in parallel, no matter whether the results is a "HIT" or a "MISS" (and a "MISS" is 
certain for all but, perhaps, one server). Since a final "MISS" cannot be concluded until 
the slowest cache server responds or the waiting timer expires, a low hit rate at any one of 
the cache servers slows down the entire system. 

Still another problem of the Harvest cache is that there is no explicit cache 
placement policy. An object may be cached at any of the cache servers. This lack of 
policy has a number of implications. For example, objects from a same Web site may 
spread over all cache servers, which forces all cache servers to be searched in order to find 
out whether an object is cached or not. 

Summary 

An improved arrangement is realized by interconnecting a plurality of cache 
servers, for example, with high speed and high capacity connection, with at least some of 
the cache servers being connected to a data network, e.g., the Internet. Each cache server 
includes a selection module that determines whether the cache server can service an 
incoming request for information, whether the request ought to be directed to another one 
of the cache servers, or whether the request be routed to the site from whence the 
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information is requested. In making this determination, in accordance with one 
embodiment, the module consults a table that associates sites or/and sub-sites with 
specific ones ofthe cache servers. In another embodiment, the determination is made by 
translating the address of the site, or sub-site, or web page that is requested; for example, 
with a hash function. 

The policy for placement of information for particular cache servers is 
cooperatively arrived within an information caching module at by the plurality of cache 
servers to comport with the selection module. Accordingly, for example, in the 
embodiment where a hash function is used to look for some site, the same hash function is 
used to determine which cache server is to cache information from a given site. In the 
embodiment where a table is used, various techniques can be employed to optimize 
operations, such as, for example, caching information at a server that is physically close to 
the community of users of the cached information. 

Brief Description ofthe Drawing 

FIG. l^resents a block diagram of a virtual cache server in accordance with one 
embodiment of thisip-vention; 

FIG. ^llustratgs^tliedi vision of the Internet network by the interjection of a virtual 
cache server; an< 

FIG. S^depicts a flow diagram of the process carried out in the virtual cache server. 

Detailed Description 

The cache arrangement disclosed herein is characterized by two attributes: an 
efficient search policy, and the existence of a placement policy that works well with the 
search policy. The search policy disclosed herein provides that a data request which 
arrives at a cache is either served by the cache (because the requested data is found in the 
cache), is directed to a cache that may contain that data, or is directed to the source of the 
requested data. The specific placement policy is not critical, as long as some policy 
exists. Of course, some placement policies are better than others. 



y 



4 



Zheng Wang 1 

The following discloses a number of embodiments that have different placement 
policies. These embodiments are merely illustrative, and other embodiments can easily 
suggest themselves to a skilled artisan. 

FIG. 1 presents a block diagram of an arrangement in accord with the principles 
5 disclosed herein. Illustratively, four servers are shown (10, 1 1, 12, and 13) which are 
interconnected by high speed, high capacity, links 20-25. To the outside "world", the 
FIG. 1 arrangement appears like a single virtual cache. It is expected that this virtual 
cache arrangement will serve some select local area in a manner not unlike that of a LAN. 
That is, the plurality of interconnected networks that collectively are known as "the 

10 Internet network" might have a number of virtual caches like the one depicted in FIG. 1 . 
Such caches might be dispersed throughout the Internet network. Relative to a given 
virtual cache, however, the Internet network can be thought of being divided into a sub- 
net of the Internet network to which access is gained by going through the virtual cache of 
FIG. 1, and the remainder of the Internet network (also a sub-network). This is illustrated 

15 in FIG. 2 with sub-nets 200 and 300, and virtual cache 250 coupling the two sub-nets. 
When an Internet user 201 requests an object, such as a web page, from a site in sub-net 
300, for example, the web page "www.lucent.com/welcome.html/" from site 
http://www.lucent.com (301), the request is routed to virtual cache 250. When the request 
arrives at virtual cache 250, the cache satisfies the request, if it can, or routes the request 

20 to site http://www.lucent.com itself. As an aside, in the context of this disclosure, a "site" 
is represented by an http address that does not include an appendage of the form "/aaa". 
Accordingly, "http://www.lucent.com" represents a site. A site can have directories, and 
subdirectories, and both the directories and the subdirectories can have files, which 
represent web pages. The address "http://www.lucent.com/welcome.html/" is the address 

25 of a web page (because it was so stated). The address format of a directory, or a 

subdirectory of a site does not differ from that of a web page. Nevertheless, it should be 
recognized that there is a difference between a sub-site (directories and subdirectories) 
and a web page. 

The basic methodology, depicted in FIG. 3, is one where a request comes into the 
30 virtual cache, and more specifically to some cache in the plurality of cache server (block 
110). A determination is made in block 1 1 1 whether the request can be serviced by the 
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cache server that received the request from the Internet. If so, control passes to block 112. 
Otherwise, control passes to block 1 13 which determines which, if any of the other cache 
servers might service the request. When such a server is identified, control passes to 
block 1 12. Otherwise, control passes to block 1 14. When control passes to block 1 12, an 
attempt is made to satisfy the request. If that attempt is successful, the process terminates. 
Otherwise, block 115 passes control to block 116 which routes the request to the site 
address contained in the request, and passes control to block 117. When control passes to 
block 1 14, a decision is made in block 1 14 as to which of the cache servers will cache 
information from the site identified in the request, and control passes to block 116. When 
block 116 routes the request to the site address identified in the request, control passes to 



Week- 117, where the request is satisfied and the information responsive to the request is 
passed to the requestor via the virtual cache. Whereupon, the cache server designated for 
caching this information (whether by virtue of information identified in blocks 111, 113, 
or 1 14) stores the information for future use. 

In one embodiment comporting with the principles disclosed herein, each server in 
the virtual cache maintains a routing table, such as routing table 101 which is associated in 
FIG. 1 with server 10. The routing table specifies the sites that are cached by each server 
in the collection of servers that make up the virtual cache. As depicted in FIG. 1, table 
101 is very simple, containing only two columns. The first column identifies a site, and 
the second column identifies the server that caches data from that site. Thus, this table is 
site-based, rather than object-based. Being more granular than an object-based table, this 
table requires fewer entries. 

When Internet user 201 is routed to virtual cache 250, the request is actually routed 
to a particular server within the virtual cache, such as server 10. Server 10 first consults 
its table 101 to determine whether it serves as the cache server for the site from which an 
object is sought. If so, it searches to find the requested object. This search can of the 
objects themselves, or of an index, or table, that is object-based. If the object is found, the 
request is serviced. If not, the request is routed to the site where the object resides, e.g., 
site 301 . If the desired site is found in table 101 but the indication is that another server 
caches that site, for example, server 1 1 , then the request is routed to the server identified 
in the table, and that server attempts to satisfy the request, if it can. If it cannot satisfy the 
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request because the particular object that is requested has not been requested before, and 
is, therefore, not found in the object-based table of server 11, then the server routes the 
request to the source itself (again, e.g., site 301), and the source site satisfies the request. 
While the requested object passes through the server that is responsible for caching the 
5 site, e.g., server 1 1 in the above example, that server captures the passing object, caches it 
for future requests, and places an appropriate entry in its object-based table. 

The placement policy, which translates to, or fashions, the contents of the routing 
tables, can differ from embodiment to embodiment. The key is that the policy is co- 
operative, in that the servers co-operate over cache placement as well as cache resolution. 

10 One policy is to simply assign all of the sites in sub-net 300 to different ones of the 

servers in virtual cache 250. The routing tables of all servers in the virtual cache would 
identically contain the assignment of sites. When the number of sites in sub-net 300 is 
reasonably small, that may be a perfectly viable approach. However, when that number is 
large, and when a large proportion of the sites is hardly ever accessed by users, then 

15 constructing a virtual cache with very large tables that need to be searched may present an 
unnecessary overhead. In some circumstances, therefore, an ad hoc may be better. In 
such an approach, whenever a request is received at a server, its associated site-based 
table is consulted, and if the site is not found, a decision is made regarding some server 
being the cache server for that site. One approach is to assign the site to the server that 

20 received the request. For example, assuming that server 1 3 receives a request that is 
destined to site 301. If, when searching through its table, server 13 fails to find an entry 
for site http://www.lucent.com, it concludes that the site has not been accessed before and, 
consequently, is not cached by any server. If server 301 has room (i.e., it has not 
exceeded some predetermined portion of its caching store or some predetermined number 

25 of cached sites) it assumes responsibility for caching site 301. Accordingly, server 13 
adds site http://www.lucent.com to its search table and broadcasts this addition to the 
other servers. The other servers add the site to their search tables as well, thereby 
accomplishing the addition of a cached site. 

Another approach for assigning responsibility for caching particular sites may be 

30 based on site proximity to the heavy users of the cached objects. For example, assuming 
that the FIG. 1 virtual cache handles all departments of a university, and that cache server 
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10 is close to the building which houses physics classes, cache server 1 1 is close to the 
building which houses psychology classes, cache server 12 is close to the building which 
houses engineering classes, and cache server 13 is close to the building which houses art 
classes department. Clearly, server 10 should cache physics related objects, which may be 
5 derived from different university Internet sites, server 1 1 should similarly cache 

psychology sites, etc. Such an assignment increases the likelihood that the first server that 
would be reached by the user would also be the site that caches the sought object. 

In accordance with another embodiment, where both the search approach and the 
placement policy differ somewhat, all but one of the servers within virtual cache 250 are 

10 assigned specific sites to cache, while the remaining server is designated as the "catch- 
all", default, server. When a request comes in to a server in an arrangement that follows 
placement policy B, the associated table is consulted, and if the desired site is found in the 
table, the request is routed (if necessary) to the appropriate server. Otherwise, the request 
is routed to the default server. The default server consults its own site-based table (which 

15 need not shared with the other servers) to determine whether it caches the desired sites. If 
so, it services the request (by searching through its object-based table, etc.). Otherwise, it 
routes the request to the specified source. 

The placement policy for this embodiment needs to identify those sites that are 
accessed often and those that are not. This is accomplished through cooperation between 

20 the servers. For example, each server can initially acquire sites that it caches in 

accordance with the ad hoc approach described above. Once a predetermined limit of 
sites is reached, the server refuses additional sites, and suggests to other servers to assume 
responsibility for caching additional sites. When all servers reach their predetermined 
limit, the default site assumes responsibility for caching additional sites. Thereafter, all 

25 servers maintain a log on how often their cache is accessed and how much storage they 
are devoting to caching objects. When a site that is cached by a non-default server is 
accessed less often than a site on the default server, the assignments are reversed. Or, 
when a server becomes too busy and is forced to shed a site, another server in the virtual 
cache needs to pick up the assignment of caching the deleted site. These tasks are part of 

30 the general load balancing that needs to take account of how often a cache is accessed and 
how much memory is occupied with cached objects. 
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Since only sites that are heavily used are admitted to the non-default servers, the 
result is a much-reduced site-based table and, consequently, a faster average cache 
response. The faster average response results from the fact that the sites that are most 
often requested are cached in servers other than the default server. Finding the proper 
cache server by looking through the shorter site-based table takes less time. 

The above-disclosed embodiments all have look-up tables; but look-up tables are 
not required. For example, the requested site can actually define the server that would 
cache it. Such defining can be accomplished by employing a hash function. A hash 
function is a function that maps a signal, such as an input string or a number, to one of M 
values. The hash function selected for the FIG. 1 arrangement would be one where M=4 
because there are four cache servers in the virtual cache. Applying the destination site to 
the hash function would yield one of 4 values, and the value thus obtained defines the 
server that is responsible for caching the desired site. Of course, as in the above 
embodiments, the particular desired object might not be found in the server's cache store 
(e.g., hard drive). 

It may be noted in passing that the disclosed approach is insensitive to whether a 
request comes to a cache server from outside the server, or from another cache server 
within virtual server 250. 

The above discussion disclosed the principles of this invention, and a skilled 
artisan can implement those principles with the help of any of the various software 
packages that are publicly available. For sake of completeness, however, the following 
presents one illustrative implementation based on the Apache cache server. 

Apache is currently the most widely used Web server software on the Internet, and 
its design has a highly modular structure which makes adding new components to it 
easier. See R. Thau, Design considerations for the Apache Server API, 5th International 
World Wide Web Conference, Paris, France, May 1996. This software can be 
downloaded from http://www.apache.org/ . 

A cache resolution/search and cache placement protocol in accordance with the 
principles disclosed herein can be implemented with a simple modification to the user 
request processing module. The current processing of a user request in Apache software 
is as follows: 
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\ check cache 
1 if object found 

return object to user 

p I else 

p 1/ if remote proxy configured 

!j> ^ forward the request to the remote proxy 

|P1/ else 

p 2> fetch directly from Web site 

o v end 
p [ return object to user update cache 
p( end 

Since Apache allows a static remote proxy to be configured, one can simply 

replace the remote proxy with a dynamic one, returned by a cache routing table lookup. 

The other modification relates to the fact that in the Apache software the server only 

caches objects from Web sites it is responsible for. Accordingly, the modified processing 

module is as follows: 

(p I check cache 
fl if object found 

return object to user 

\ else 

check cache the site-based routing table 

if cache server found and is not itself 
P ^ forward the request to the cache server 

j)^ else 

p % fetch directly from Web site 

f 9 update cache 

pt, end 
p { return object to user 
f\ end 

The processing of requests forwarded by another cache server is slightly different 

from that of requests directly from users: 

^ \ check cache 
^ 1 if object found 

f)1/ return object to originated cache server 

X f\ else 

)V fetch directly from Web site 

^ return object to originated cache server 

update cache 
p\ end 

The communication between servers can be implemented in various ways, but I 
found that the Cache Information Exchange Protocol (CIEP) is quite acceptable. CIEP is 
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currently implemented over TCP, but it need not be limited to TCP. Each CIEP packet 
contains a header, and cache routing entries. The header contains an opcode, a version 
number, a packet length field, a sequence number, and a sender host address. Currently, 
the following opcodes are defined: 

• CIEP_ADD: An announcement for new cache routing entries. Upon receiving a 
CIEP_ADD packet, a server may add the new cache routing entries in its cache 
routing table. 

• CIEP_DELETE: An announcement for deleted cache routing entries. Upon receiving 
a CIEPDELETE packet, a cache server must delete the entries from its cache routing 
table. 

• CIEP_FULL: A request for the entire current valid cache routing entries. Upon 
receiving a CIEP_FULL packet, a cache server should return a CIEP_ADD packet 
with all valid entries it is responsible for. 

• CIEP_OVERLOAD: When a cache server experiences overload for particular Web 
sites, it may send a CIEPOVERLOAD packet. Upon receiving the packet, other 
cache servers may use local private cache to reduce the load. 

The above discussion covered the question of site assignment to the cache server. 
However, that does not explicitly cover the question of how to add a server to the virtual 
cache, for example, after being offline for a while. When such a server first starts up, it 
has to establish an initial cache routing table. It may send CIEP_FULL packets to other 
servers to retrieve existing cache routing table. The server may also choose the Web sites 
for which it wants to become the designated cache server by configuration or/and by 
learning from access patterns. When a server is started, a list of Web sites can be supplied 
as parameters. The list may be derived from the logs of previous cache servers or based 
on human input. Alternatively, a server may enter a learning mode, gradually adding 
cache routing entries as it gathers request statistics. 

It is possible under certain site placement policies for a number of cache servers to 
choose the same Web sites simultaneously. This can happen, for example, when some 
Web site is not in the cache routing table but suddenly become very popular, triggered by 
a synchronizing event such as email announcements to large mailing lists. Several cache 
servers may see a sudden increase in requests for a particular Web site, and start to send 
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CEIP_ADD packets. To avoid such collision, all CEIP_ADD and CEIP_DELETE 
packets can be sent after a randomized delay period between 0-30 seconds. 

There are, of course other ways to control the load balancing interactions between 
the servers. For example, although all of the servers are hierarchically equal, one of the 
servers can be in possession of a token which makes that server slightly "more equal". At 
regular intervals the servers communicate with a server that possesses the token, and that 
server determine whether, for the sake of load balancing, a given server must, for 
example, relinquish responsibility, and another server must assume responsibility, for 
caching a given site. 

In addition to all of the above, a cache server may have a local private cache that is 
not visible to the virtual cache per se. In arrangements where a private cache is permitted, 
when a Web site is not found in the cache routing table, and there is no designated default 
server, the cache server has two options after fetching the requested object directly from 
the source Web site. It can cache the object and make it public, or it can simply put in its 
private cache. Even when a Web site has an entry in the cache routing table, a cache 
server may decide to privately cache some objects from that Web sites for certain reasons. 
For example, private cache may be used when the designated cache server becomes 
overloaded or when some Web sites require special treatment. The policies on the use of 
private cache are entirely local to the cache server involved. 

The above disclosure of various embodiments is illustrative of the principles of 
this invention, but it should be noted that various modifications can be incorporated 
without departing from the spirit and scope of this invention. For example, whereas the 
routing tables disclosed above are site-based, one can easily envision arrangements where 
some site is divided into sub-sites (for example, because of the popularity of the site), and 
the routing table would be permitted to specify sub-sites. 




